Joint Classifier and Kernel Design
نویسنده
چکیده
From a machine learning perspective, the analysis of gene expression data is complicated by the extremely large feature dimensionality. The presence of a large number of irrelevant features—here genes—makes such analysis prone to due to the curse of dimensionality. To overcome this limitation, Support Vector Machines (SVM) are widely employed, since it is well known that they possess good generalization properties even in the presence of irrelevant predictor variables. Motivated by error bounds from computational learning theory, we present a Bayesian generalization of the SVM that jointly learns the optimal classifier and kernel simultaneously from the data. Theoretical and experimental results are provided to show that learning the kernel results in automatic feature selection and hence mitigates the problem of large dimensionality. I. EXTENDED ABSTRACT In the traditional pattern recognition literature, the problem of cancer diagnosis using the gene expression profile of a new tissue sample and a database of previously expression profiles and their diagnoses falls under the general class of supervised pattern recognition. Given a database of training samples from N tissues, we have a set of N expression profiles x indexed by i ∈ {1, 2, . . . , N}. Each expression profile x = [x 1 , x (i) 2 , . . . , x (i) d ] ∈ R is a d-dimensional vector representing the measured expression levels of of d genes in the tissue sample. The class membership of each database sample is known and is denoted by y. In a two-class case (e.g., the tissues are either cancerous or non-cancerous), we can assume without loss of generality that y ∈ {0, 1}. Thus, the training set D consists of N sets of expression profiles and their corresponding class membership labels: D = { 〈x, y〉 : x ∈ R, y ∈ {0, 1} }N
منابع مشابه
Object Recognition based on Local Steering Kernel and SVM
The proposed method is to recognize objects based on application of Local Steering Kernels (LSK) as Descriptors to the image patches. In order to represent the local properties of the images, patch is to be extracted where the variations occur in an image. To find the interest point, Wavelet based Salient Point detector is used. Local Steering Kernel is then applied to the resultant pixels, in ...
متن کاملSUBCLASS FUZZY-SVM CLASSIFIER AS AN EFFICIENT METHOD TO ENHANCE THE MASS DETECTION IN MAMMOGRAMS
This paper is concerned with the development of a novel classifier for automatic mass detection of mammograms, based on contourlet feature extraction in conjunction with statistical and fuzzy classifiers. In this method, mammograms are segmented into regions of interest (ROI) in order to extract features including geometrical and contourlet coefficients. The extracted features benefit from...
متن کاملStudying Influence of Preheating Conditions on Design Parameters of Continuous Paint Cure Ovens
This paper concentrates on a new procedure which experimentally recognises gears and bearings faults of a typical gearbox system using a least square support vector machine (LSSVM). Two wavelet selection criteria Maximum Energy to Shannon Entropy ratio and Maximum Relative Wavelet Energy are used and compared to select an appropriate wavelet for feature extraction. The fault diagnosis method co...
متن کاملLinear and Kernel Classification: When to Use Which?
Kernel methods are known to be a state-of-the-art classification technique. Nevertheless, the training and prediction cost is expensive for large data. On the other hand, linear classifiers can easily scale up, but are inferior to kernel classifiers in terms of predictability. Recent research has shown that for some data sets (e.g., document data), linear is as good as kernel classifiers. In su...
متن کاملThe Joint Submission of the TU Berlin and Fraunhofer FIRST (TUBFI) to the ImageCLEF2011 Photo Annotation Task
In this paper we present details on the joint submission of TU Berlin and Fraunhofer FIRST to the ImageCLEF 2011 Photo Annotation Task. We sought to experiment with extensions of Bag-of-Words (BoW) models at several levels and to apply several kernel-based learning methods recently developed in our group. For classifier training we used non-sparse multiple kernel learning (MKL) and an efficient...
متن کاملAn EM Algorithm for Joint Feature Selection and Classifier Design
The problems of accurate classifier design and feature (predictor variable) selection have typically been considered in isolation of one another, with an independent feature selection phase often preceding the training of a classifier. However, the process of feature selection is most effective when it is undertaken with an eye towards retaining those features that are most relevant to the clas...
متن کامل